E:\pctex\samples\fir1.dvi 02
نویسندگان
چکیده
The current speech interfaces in many military applications may be adequate for native speakers. However, the recognition rate drops quite a lot for non-native speakers (people with foreign accents). This is mainly because the nonnative speakers have large temporal and intra-phoneme variations when they pronounce the same words. This problem is also complicated by the presence of large environmental noise such as tank noise, helicopter noise, etc. In this paper, we proposed a novel continuous acoustic feature adaptation algorithm for on-line accent and environmental adaptation. Implemented by incremental singular value decomposition (SVD), the algorithm captures local acoustic variation and runs in real-time. This feature-based adaptation method is then integrated with conventional model-based maximum likelihood linear regression (MLLR) algorithm. Extensive experiments have been performed on the NATO non-native speech corpus with baseline acoustic model trained on native American English. The proposed feature-based adaptation algorithm improved the average recognition accuracy by 15%, while the MLLR model based adaptation achieved 11% improvement. The corresponding word error rate (WER) reduction was 25.8% and 2.73%, as compared to that without adaptation. The combined adaptation achieved overall recognition accuracy improvement of 29.5%, and WER reduction of 31.8%, as compared to that without adaptation. Keywords—speaker adaptation; environment adaptation; robust speech recognition; SVD; non-native speech recognition
منابع مشابه
Three molecular structures cause rhesus D category VI phenotypes with distinct immunohematologic features.
Rhesus D category VI (DVI) is the clinically most important partial D. DVI red blood cells were assumed to possess very low RhD antigen density and to be caused by two RHD-CE-D hybrid alleles. Because there was no population-based work-up, we screened three populations in central Europe for DVI. Twenty-six DVI samples were detected and examined by exon-specific RHD polymerase chain reaction wit...
متن کاملComparison Of Direct Visual Inspection (DVI) With Pap Smear In Diagnosis Of Precancerous Lesion Of Cervix
The aim of this study was to compare direct visual inspection (DVI) with Pap smear in diagnosis of precancerous lesion of cervix. A total of 1500 women were screened cytologically as well as clinically with direct visual inspection of cervix after application of acetic acid (DVI). A total of 1500 women were screened cytologically as well as clinically with direct visual inspection of cervix aft...
متن کاملDifferential variational inequalities
This paper introduces and studies the class of differential variational inequalities (DVIs) in a finite-dimensional Euclidean space. The DVI provides a powerful modeling paradigm for many applied problems in which dynamics, inequalities, and discontinuities are present; examples of such problems include constrained time-dependent physical systems with unilateral constraints, differential Nash g...
متن کاملspecials for PDF generation
DVIPDFM(x) manages various PDF effects by means of DVI specials. Appropriate documentation of DVI specials, however, is not easy to find, and exact functionality is not simple to catch without reading the source code of DVI drivers. This paper deals with the DVI specials defined in DVIPDFM(x) that are mainly used for PDF generation. We discuss the features of those specials with some examples, ...
متن کاملForensic odontology in the disaster victim identification process.
Disaster victim identification (DVI) is an intensive and demanding task involving specialists from various disciplines. The forensic dentist is one of the key persons who plays an important role in the DVI human identification process. In recent years, many disaster incidents have occurred that challenged the DVI team with various kinds of difficulties related to disaster management and unique ...
متن کاملHacking DVI files: Birth of DVIasm
This paper is devoted to the first step of developing a new DVI editing utility, called DVIasm. Editing DVI files consists of three parts: disassembling, editing, and assembling. DVIasm disassembles a DVI file to a human-readable text format (more flexible than DTL), and assembles the output back to a DVI file. DVIasm is useful for people who have a DVI file without TEX source, but need to modi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006